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Abstract 

Although many authors have considered how many ternary comparisons it takes to 
I— j sort a multiset S of size n, the best known upper and lower bounds still differ by a 

term linear in n. In this paper we restrict our attention to online stable sorting and 
i—i prove upper and lower bounds that are within o(n) not only of each other but also 

of the best known upper bound for offline sorting. Specifically, we first prove that if 
j> the number of distinct elements a = o(n/ log n), then (H + l)n + o(n) comparisons 

are sufficient, where H is the entropy of the distribution of the elements in S. We 
then give a simple proof that {H + l)n — o(n) comparisons are necessary in the 



O worst case. 
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1 Introduction 



Comparison-based sorting is perhaps the most studied problem in computer 
science, but there remain basic open questions about it. For example, how 
many ternary comparisons are needed to sort a multiset S of size n? By a 
ternary comparison, we mean one that can return <, = or >; we count only 
comparisons between elements of the multiset, not those between data gen- 
erated by the algorithm. Over thirty years ago, Munro and Spira [TT] proved 
distribution-sensitive upper and lower bounds that differ by C(nloglogcr), 
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Table 1 

Bounds on the number of ternary comparisons needed to sort a multiset offline. 





upper bound 


lower bound 


Munro and Raman 10) 




(H - loge)n + O(logn) 


Fischer [3] 


(H + l)n-a 


{H- log H)n-0{n) 


Dobkin and Munro [2] 




(h nlog(logn ^* occfa - S) w logocc(0i ' S) ))n 0(n) 


Munro and Spira |11J 


nH + 0{n) 


nH — (n — a) log log a — 0(n) 



where a is the number of distinct elements in S. Their bounds have been 
improved in a series of papers — summarized in Table ffl - - and now the 
best known upper and lower bounds differ by a term linear in n (about 
(1 + loge)n « 2.44n when o = o(n)). In this paper we restrict our atten- 
tion to online stable sorting and prove distribution-sensitive upper and lower 
bounds that are within o(n) not only of each other but also of the best known 
upper bound for offline sorting. When a = o(n/ log n) we can now say, for 
example, exactly how many comparisons on average per element are needed 
to sort online and stably as n goes to infinity. 

Stable sorting algorithms preserve the order of equal elements, and we define 
online sorting algorithms to be those that process S element by element and 
keep those already seen in sorted order. We believe our definition is reasonable 
because, if we want a sorting algorithm to be stable and each comparison it 
makes to involve the most recently read element — to prevent it delaying 
all its decisions until it has read all of S — then it must determine each 
element's rank and distinctness among those already seen, before reading the 
next element. To see why, suppose the most recently read element is x and, for 
some element y already seen, the algorithm reads the next element without 
determining whether x < y, x = y, or x > y. Without loss of generality, 
assume x < y. If all the remaining elements are either strictly less than x 
or strictly greater than y, then the algorithm has no further opportunity to 
determine x and y's relative order. Notice that, since the algorithm has no 
information about an element until it has been involved in a comparison, it 
makes no difference in the worst case whether the algorithm chooses the order 
in which the elements are to be read, or an adversary does. 

Splay-sort, for example — sorting by insertions into a splay-tree [13] - is 
both online and stable. It uses 0((H + l)n) comparisons and time, where 
H = Ylf=i occ ^' s ^ log oce ^ m is the entropy of the distribution of the elements, 
ai, . . . , di are the distinct elements, occ (a^, 5") is the number times occurs in 
S, and log means log 2 . Such bounds are called distribution-sensitive because 
they depend not only on n but also on the distribution of elements. It is not 
hard to show splay-sort's bound is within a constant factor of the optimal 
offline bound: we can encode S by recording the result of each comparison in 
a constant number of bits, together with an O{o log n)-bit table containing 
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the distinct elements' frequencies; we generally cannot encode S in fewer than 
nH bits; we must also use at least one comparison per element; therefore, we 
must use Q((H + l)n) comparisons. Notice that, since we can implement a 
ternary comparison as two binary comparisons — which can return only < or 
> — and asymptotic notation can hide a two-fold increase, splay-sort's bound 
holds for both types of comparison. In contrast, our results in this paper show 
that ternary comparisons are slightly more powerful. 

Our starting point is another online stable sorting algorithm, due to Fred- 
man |4] and based on a result by Gilbert and Moore [BJ. In Section [2] we 
show how to use this algorithm to sort S online and stably using at most 
(H + 2)n + o(n) binary comparisons and 0(\oga) time per comparison when 
a = o{nj logn), or constant time per comparison when a = o {^Jn/ log n). In 
Section [3] we give a simple proof that (H + 2)n — o(n) binary comparisons are 
necessary in the worst case. In Section [4] we turn our attention to sorting with 
ternary comparisons and show how to modify Fredman's algorithm to use at 
most (H + l)n + o(n) ternary comparisons when a = oinj logn), by replacing 
Gilbert and Moore's result with one by Mehlhorn [9]; the time bounds remain 
the same. In Section [5] we modify our lower bound to show (H + l)n — o(n) 
ternary comparisons are necessary in the worst case. Notice our upper and 
lower bounds here are within o(n) not only of each other but also of the best 
known upper bound for offline sorting, (H + l)n — a, due to Fischer [3] and 
shown in the second row of Table [TJ We hope studying online sorting will ulti- 
mately provide insights that will help narrow the gap between the upper and 
lower bounds for offline sorting. Since sorting is fundamental, studying online 
sorting may even provide insights into online versions of other problems. We 
must admit, the main insights we use in this paper came not from studying 
the literature about sorting — although they easily could have — but from 
writing a recent paper [5] about adaptive prefix coding. 



2 Upper bound on binary comparisons 

Fredman [I] showed how, given an n-tuple (x\, . . . ,x n ) drawn from a uni- 
verse U, we can determine its components one by one using a total of at 
most log \ U\ +2n binary comparisons. To determine x^, Fredman's algorithm 
restricts its attention to the candidate n-tuples consistent with xi, . . . ,x,_i; 
counts how often each value occurs as the ith component in these candidates; 
uses the following theorem by Gilbert and Moore [6J to build a leaf-oriented 
binary search tree for the values' normalized frequencies; and inserts X{ into 
that tree. The key observation is that, if the algorithm uses c comparisons to 
determine x^, then fewer than a (l/2 c ~ 2 )-fraction of the candidates consistent 
with xi, . . . , are also consistent with X{. 
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Theorem 1 (Gilbert and Moore, 1959) Given a probability distribution 
Pi, ■ ■ ■ ,Pk containing only positive probabilities, we can build an ordered binary 
tree whose leaves are at depths at most |~log(l/pi)] + 1, . . . , |~log(l/pfc)] + 1. 

Fredman's result immediately implies that, given the frequencies of the distinct 
elements in S, we can sort it online and stably using at most 



log (occ (ai, S) , . . . , occ (a a , Sfj ! + 2n 



comparisons, where (occ (a±, S) , . . . , occ (a a , S))\ = n\j V&=\ occ (a*, S)\ is the 
number of strings in which each element appears occ (a», S) times. By the 
following lemma, this bound is at most (H + 2)n + o(n). 

Lemma 2 log (occ (at, S) , . . . , occ (a a , S))\ < nH + O(logn). 



PROOF. Robbins' extension [12] of Stirling's Formula, 

y^ x x+l/2 e -x+l/(12x-+l) < x \ < v /2^ a , :C +l/2 e -:r+l/(12x) 

implies 

x log x — x log e < log x\ < x log x — x log e + 0(log x) . 
Therefore, since J2i occ S) = n, straightforward calculation shows that 

a 

log (occ (ai, 5) , . . . , occ (a CT , S 1 )^ ! = logn! — logocc (a^, S)\ 

i=i 

is at least ni!f — C?(crlog(n/cr)) and at most nif + 0(logn). □ 



Of course, given the frequencies of the distinct elements in S, we can use The- 
orem [T] to build a single leaf-oriented binary search tree whose leaves store 
the distinct elements in S, with each distinct element aj at depth at most 

log(n/occ (aj, S)) 
total of at most (H 



we can then insert each element into that tree using a 
2)n+o(n) comparisons. We are interested in Fredman's al- 
gorithm because we can use it to sort S using (H+2)n+o(n) comparisons even 
when we are not given the frequencies in advance, as long as o = o(n/ log n). 
To prove this, we start by modifying Theorem [l] 

Lemma 3 Given a probability distribution Pi, ■ ■ ■ ,Pk containing only positive 
probabilities, in 0{k) time we can build an ordered binary tree on2k + l leaves 
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with height at most log(l/ mmi<i< k {pi}) + 1 whose even-numbered leaves 
are at depths at most |~log(l/pi)] + 1, . . . , |~log(l/pfc)] + 1. 



PROOF. For 1 < i < 2k + 1, consider the ith term in the sequence 



o, TpPi, [pi + ,(pi+Pz),---, [J2pj + y ] >!■ 



P2 



If i is even, then the ith term differs from every other term by at least pj/2/2, 
so its binary representation bi differs from all of theirs in one of its first 

log(l/pi/2) + 1 bits after the binary point. Therefore, in a trie containing 
the shortest prefixes of 61, ... , &2fc+i necessary to uniquely identify them, the 
even- numbered leaves are at depths at most |~log(l/pi)] +1, . . . , |~log(l/pfc)] +1; 
it follows that the odd-numbered leaves are all at depths at most 

log(l/ mmi<i< k {pi}) + 1. Since such a tree exists, we can use an algorithm 
by Kirkpatrick and Klawe [7j to build one in O(k) time. □ 



We need not do anything to process the first element, S[l], since any singleton 
set is trivially sorted. (For convenience, we write S[i] to indicate the ith ele- 
ment in S and >S[z..j] to indicate the ith through jth elements.) Assume for the 
moment that we are not concerned with how much time we use to sort S. To 
process S[i] for i > 2, we first apply Lemma [3] to the probability distribution 
occ (ax, S[l..(i — 1)]) /(«— 1), • • • , occ (a k ,S[l..(i — 1)]) /(i— 1), where a±, . . . ,a k 
are the distinct elements in S[l..(i — 1)]. Let Tj be the resulting tree. We label 
Tj's even-numbered leaves with ai, . . . , a k and its odd-numbered leaves with 
blanks; at each internal node, we store the labels of the rightmost leaf in that 
node's left subtree and the leftmost leaf in its right subtree, one of which is 
always a blank. Finally, we insert S[i] into Tj as follows: at each internal node 
that stores a blank and a key aj, in that order, we check whether aj < S[i] 
and descend to the right if so and to the left if not; at each internal node that 
stores a key aj and a blank, in that order, we check whether S[i] < aj and 
descend to the left if so and to the right if not. If S[i] is an occurrence of some 
a, that has already occurred, then we eventually reach the jth even-numbered 
leaf and, thus, use at most |~log((z — l)/occ (aj, S[l..(i — 1)]))] +1 comparisons. 
On the other hand, if S[i] is not equal to any previous character and has rank 
j in S[l..(i — 1)], then we eventually reach the jth odd-numbered leaf and, 
thus, use at most |~log(z — 1)] +1 comparisons. 

Theorem 4 We can sort S online and stably using at most (H + 2)n + o(n) 
binary comparisons when a = o(n/\ogn). 
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PROOF. Since the multiset 

{occ{S[i),S[l..{i-l))) : l<i<n}= \J {0, . . . , occ (a a , S) - 1} 



we use at most 



E 



log 



i - 1 



max 



+ 1 



(occ(S\i],S[l..(i-l)]),l] 

n n 

<^log(i- 1) -^logmax(occ(5'[i] ) 5[l..(i- l)]),l) + 2n 



i=l 



i=l 



= \og(n - 1)! - log(occ (dj, S) - 1)! + 2n 

3=1 

a 

< log n! — log occ (a j, S)\ + 2n + O{o log n) 

= log (^occ (ai, S) , . . . ,occ (a CT , S 1 )^! + 2n + O(alogn) 

comparisons. By Lemma [2j this is at most (H + 2)n + O(alogn) and, thus, 
(H + 2)n + o{n) when a = o{n/ log n). □ 



To make our modification of Fredman's algorithm run in (9(logcr) time per 
comparison rather than 0(a) time, we do not actually build the trees T 2 , . . . , T n 
but, instead, maintain an augmented AVL-tree [T] that stores the partial sums 
of the frequencies of the distinct elements seen so far. Since each Tj is defined 
by partial sums (recall the proof of Lemma [3]), we can use this AVL-tree as 
an implicit representation of 7] when processing S[i}. In particular, it is not 
difficult to use it to compute the labels stored at each internal node of in 
(D(\oga) time. 



If a — o y\Jn/ lognj, on the other hand, then we can use 0(1) time per com- 
parison. In this case, we build only a few of the trees T 2 , . . . , T n . Specifically, 
we build a new tree whenever the number of elements processed since we built 
the last tree is equal to the number of distinct elements we had processed at 
that time; it follows that we need only 0(1) amortized time per element. Since 
we want the tree into which we insert S[i] always to contain all the distinct 
elements in S[l..(i — 1)], at each odd-numbered leaf we store a pointer to an 
AVL-tree. If we reach an odd-numbered leaf when processing S[i], then we in- 
sert S[i] into the corresponding AVL-tree in the standard way, implementing 
each ternary comparison as two binary comparisons. Calculation shows we use 
a total of at most 
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max(occ(S[i],S[l..(i-l)])-a,l) J V 

V log(occ(a i)1 S)- ( j-l)! + 2n + c( ( T 2 loga) 




< logn! — 



<7 



< log n\-J2 lo S occ ( a i> S)\ + a log n a+1 + 2n + 0(a 2 log tr) 



< (if + 2)n + C(ff 2 logn) 



comparisons and, thus, (ff + 2)n + o(n) when a = o yjnj 'lognj. Notice we 
subtract a from occ (<S'[i], S[l..(i — 1)]) in the formula above because we always 
build the tree with which we process S[i] sometime after processing S[i — a]. 

3 Lower bound on binary comparisons 

Consider any online stable sorting algorithm that uses only binary compar- 
isons to determine the stably sorted order of the elements in S. Since it is 
online, it must determine each element's rank relative to the distinct elements 
already seen before moving on to the next element. Since it uses only binary 
comparisons, we can view its strategy for each element as a binary decision tree 
whose even-numbered leaves are labelled with the distinct elements already 
seen and whose odd-numbered leaves correspond to the intervals in which the 
next distinct element could lie. (We consider its strategy as a decision tree 
rather than a search tree because we do not want to specify what comparisons 
it makes at internal nodes.) If the current element has been seen before, then 
the algorithm reaches the even-numbered leaf labelled with that element in 
the decision tree; if not, then it reaches an odd-numbered leaf. In both cases, 
the number of comparisons the algorithm uses is at least the depth of the leaf. 

Suppose an adversary starts by presenting one copy of each of a — 1 distinct 
elements; after that, it always considers the algorithm's strategy for the current 
element as a binary decision tree with 2(<t — 1) + 1 leaves, and presents the label 
of the deepest even-numbered leaf (except that it presents the <rth distinct 
element as the last element in S). It is not difficult to show that, for the right 
choice of a, the adversary thus forces the algorithm to use (ff + 2)n — o(n) 
comparisons. 

Theorem 5 In the worst case we need to make at least (H + 2)n — o{n) binary 
comparisons to sort S online and stably. 



PROOF. Suppose a = 2L lo sW lo s^)J + 1. Since a is 1 more than a power of 
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2, any binary tree on 2(cr — 1) + 1 leaves has an even-numbered leaf (in fact, 
some consecutive pair of leaves) at depth at least 

log(<7 - 1) + 2 > log a - + 2 = log a + 2 - o(l) . 

a — 1 

Therefore, since any distribution on a elements has entropy at most logo", 
some even-numbered leaf has depth at least H + 2 — o(l). It follows that the 
adversary forces the algorithm to use a total of at least (H + 2 — o(l))(n — a) = 
(H + 2)n + o(n) comparisons to process the cxth through (n — l)st elements of 
S. □ 



We have the adversary hold the cxth distinct element in reserve until the end 
because, this way, our lower bound holds even when the algorithm knows a in 
advance. If the adversary started by presenting all a distinct elements then, 
since the algorithm would know it had already seen all the distinct elements, 
from then on it could use a decision tree on only a leaves. 



4 Upper bound on ternary comparisons 

Shortly after Munro and Spira [11] and Fredman p[] published their results, 
Mehlhorn [§] published a generalization of Gilbert and Moore's [UJ. In this 
paper we do not need Mehlhorn's full result, only the weaker version below. 
We note that, although Mehlhorn's proof was not based on partial sums, 
Knuth p] quickly gave another that was. 

Theorem 6 (Mehlhorn, 1977) Given a probability distribution pi,...,pk 
containing only positive probabilities, in O(k) time we can build an ordered 
binary tree whose nodes, from left to right, are at depths at most log(l/pi), . . . , 
log(l/p fe ). 

Given the frequencies of the distinct elements in S, we can use Theorem [6] to 
build a (node-oriented) binary search tree whose nodes store the distinct ele- 
ments in S, with each distinct element aj at depth at most log(n/occ (o 3 -, S)); 
we can then insert each element into that tree using a total of at most 
(H + l)n + o(n) comparisons. Notice the first term in this bound is (H + l)n 
rather than nH because the number of comparisons used to reach and stop at 
an internal node is 1 more than its depth; the last comparison is performed at 
the internal node itself and indicates that we should stop there. Because the 
tree contains all the distinct elements in S, however, we need never perform a 
comparison at a leaf. 
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We can sort S using at most (H + l)n + o(n) comparisons even when we 
are not given the frequencies in advance, as long as a = o(n/ logn), by mod- 
ifying Fredman's algorithm to use Theorem [6] and ternary comparisons, in- 
stead of Theorem [T] and binary comparisons. Again, we need not do anything 
to process S[l]. Assume for the moment that we are again not concerned 
with how much time we use to sort S. To process S[l] for i > 2, we first 
apply Theorem [6] to the probability distribution occ (ai, S[l..(i — 1)]) /{% — 
1), . . . , occ (afc, S[l..(i — 1)]) j{% — 1), where ai, . . . , a& are again the distinct 
elements in S[l..(i — 1)]. Let T, be the resulting tree. We store a\, . . . , a& at 
Tj's nodes, from left to right, thus making it a binary search tree. Finally, we 
insert S[i] into Tj in the standard way, using ternary comparisons. If S[i] is an 
occurrence of some aj that has already occurred, then we eventually reach and 
stop at the node storing aj and, thus, use at most \og(l/pj) + 1 comparisons. 
Notice that, because Tj may not contain all the distinct elements in S, only 
those in S[l..(i — 1)], now we must also perform a comparison if we reach a 
leaf. On the other hand, if S[i] is not equal to any previous character and has 
rank j in S[l..(i — 1)], then we eventually reach a leaf storing either a 3 -_i or aj 
- i.e., either the predecessor or the successor of S[i] in S^l..^ — 1)] ■ - and, 
thus, use at most log(i — 1) + 1 comparisons. Essentially the same calculations 
as in the proof of Theorem [4] show we use a total of at most (H + l)n + o(n) 
comparisons. 

Theorem 7 We can sort S online and stably using at most (H + l)n + o(n) 
ternary comparisons when a = o(n/logn). 



Because Knuth's proof of Theorem 
the techniques described in Section 



is based on partial sums, we can reuse 
to make this algorithm run in 0(\oga) 



time per comparison, or 0(1) time per comparison when a = Oi^Jnj log n 



5 Lower bound on ternary comparisons 



Now consider any online stable sorting algorithm that uses ternary compar- 
isons to determine the stably sorted order of the elements in 5*. Again, since 
it is online and stable, it must determine each element's rank relative to the 
distinct elements already seen before moving on to the next element. Since 
it uses ternary comparisons, we can view its strategy for each element as an 
extended binary search tree whose internal nodes store the distinct elements 
already seen and whose leaves correspond to the intervals in which the next 
distinct element could lie. If the current element has been seen before, then 
the algorithm reaches and stops at the internal node storing that element, and 
the number of comparisons that it uses is at least the depth of that internal 
node plus 1. If not, then it reaches a leaf and the number of comparisons it 
uses is at least the depth of that leaf. 
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To see why we can consider the algorithm's strategy for each element as an 
extended binary search tree, first consider it as a ternary decision tree. Since 
the algorithm keeps the elements already seen in sorted order and middle 
children are reached only when a comparison returns =, the subtrees rooted 
at those children are all degenerate. Therefore, without loss of generality, we 
can assume that middle children are leaves and labelled with the distinct 
elements already seen, while leaves that are left or right children correspond 
to the intervals in which the next distinct element could lie. By deleting each 
middle child and storing its label at its parent, we obtain an extended binary 
search tree whose internal nodes store the distinct elements already seen and 
whose leaves correspond to the intervals in which the next distinct element 
could lie. 

Suppose an adversary again starts by presenting one copy of each of o — 1 
distinct elements; after that, it always considers the algorithm's strategy for 
the current element as an extended binary search tree with a — 1 internal 
nodes, and presents the element stored at the deepest internal node (except 
that it presents the crth distinct element as the last element in S). Essentially 
the same argument as in the proof of Theorem [4] shows that, for the right 
choice of a, the adversary thus forces the algorithm to use (H + l)n — o(n) 
comparisons. 

Theorem 8 In the worst case we need to make at least (H+l)n—o(n) ternary 
comparisons to sort S online and stably. 

PROOF. Suppose again that a = 2L lo s(«/i°g™)J + 1. Since a is 1 more than a 
power of 2, any binary search tree with a — 1 internal nodes has an internal 
node at depth at least log (cr — 1). Essentially the same calculations as in the 
proof of Theorem [4] show that log(a — 1) > H — o(l). Therefore, when the 
algorithm is processing the crth through (n — l)st elements in S, the adversary 
always chooses an internal node at depth at least H — o(l) and, so, forces the 
algorithm to use at least H + 1 — o(l) comparisons per element. Thus, the 
algorithm uses a total of at least (H + 1 — o(l))(n — cr) = (H + l)n — o(n) 
comparisons. □ 



We again have the adversary hold the crth distinct element in reserve until the 
end, because, this way, our lower bound holds even when the algorithm knows 
a in advance. If the adversary started by presenting all a distinct elements 
then, since the algorithm would know it had already seen all the distinct 
elements, it would not have to make a comparison at any internal node with 
two leaves as children, in particular the deepest internal node. 

Combining Theorems [7] and [8] yields the following corollary which, for a = 
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o(nj logn), tells us exactly how many comparisons are needed on average per 
element to sort online and stably as n goes to infinity. 

Corollary 9 When a = o(n/ logn) , in the worst case it takes an average of 
H + 1 ± o(l) ternary comparisons per element to sort S online and stably. 
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